CDS

Accession Number TCMCG075C29312
gbkey CDS
Protein Id XP_017984270.1
Location join(2931924..2932499,2932943..2933254)
Gene LOC18507147
GeneID 18507147
Organism Theobroma cacao

Protein

Length 295aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018128781.1
Definition PREDICTED: ankyrin-1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category V
Description Ankyrin repeat-containing protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03036        [VIEW IN KEGG]
KEGG_ko ko:K21435        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0001101        [VIEW IN EMBL-EBI]
GO:0002376        [VIEW IN EMBL-EBI]
GO:0006950        [VIEW IN EMBL-EBI]
GO:0006952        [VIEW IN EMBL-EBI]
GO:0006955        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009719        [VIEW IN EMBL-EBI]
GO:0009725        [VIEW IN EMBL-EBI]
GO:0009751        [VIEW IN EMBL-EBI]
GO:0010033        [VIEW IN EMBL-EBI]
GO:0014070        [VIEW IN EMBL-EBI]
GO:0042221        [VIEW IN EMBL-EBI]
GO:0042493        [VIEW IN EMBL-EBI]
GO:0045087        [VIEW IN EMBL-EBI]
GO:0046677        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]
GO:1901700        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGATGAAAGGTTGAGGAGTGCAGCTCTATCAGGAAATATAGATGCCTTGTATTATTTAATTCGAGATGACGCAGATGTTTTACAACGCATCGATGAGATGGCGTTTGTTGATACTCCACTGCACATAGCTGCAGCTGCAGGGCACACTGATTTTGCAATGGAGTTAATGAACTTAAAGCCATCATTCGCTAGGAAGGTCAACCAATGCGGATTTAGCCCTCTTCACCTAGCCTTGCCAAATAAACAAGAAAAGATGGTGGCTCATCTCCTGTTAATTGATAAAGATCTTGTTCGTGTCAAAGGGAGGGAGGGTCACACTCCTCTTCATCATGCAGCCAAGGAAGGAAATGTTCCTCTTCTGTCTCAATTTCTGGACCAATGCCCCAATTCTATCTTAGATTTGACTATTCGAAAAGATACTGCTGTGCATATTGCAGCACAAAATAATCATTTAGAAGCTTTCAAAGCCATACTGCGACGGCTTCCCACTGTATACGAAGTAAGAATCCTAAACTTAGAGGACAAGGATGGAAACACTGTGTTGCACATAGCCGCATCAAATAACCAACGCCAGATGATCAAACTGTTAATAAAAAGCCAGAAGGTTGATTGGAATAAGGTTAATCAGAGTGGTTTTACAGCTTTGCCTGTCTTGGAAGCACCAGCTGGAGATGACAGCAGAGAGAGTGTGAGCATGCTGAAACATGCCAAAGTTCCACCCTTAATTTTTTTAGGGAAGATGCTTCTTCAAAGTCGGTGTTTTACTGAAATAATAACTGATATTCTGGAAATGAAAACTGATACGATCAATACGTTGCTAGTCGTATTGGCACTGATTCTATCGATGACTTACCAAGCTGTCCTCAGCCCACCGGCTGGTGCTTGA
Protein:  
MDERLRSAALSGNIDALYYLIRDDADVLQRIDEMAFVDTPLHIAAAAGHTDFAMELMNLKPSFARKVNQCGFSPLHLALPNKQEKMVAHLLLIDKDLVRVKGREGHTPLHHAAKEGNVPLLSQFLDQCPNSILDLTIRKDTAVHIAAQNNHLEAFKAILRRLPTVYEVRILNLEDKDGNTVLHIAASNNQRQMIKLLIKSQKVDWNKVNQSGFTALPVLEAPAGDDSRESVSMLKHAKVPPLIFLGKMLLQSRCFTEIITDILEMKTDTINTLLVVLALILSMTYQAVLSPPAGA